A new approach to modeling excitation in very low-rate speech coding
نویسندگان
چکیده
A new method for two-band approximation of excitation signals in an LPC model, to improve speech naturalness in very low rate coding, is proposed. Based on a simpli ed model of Multi-Band Excitation, the method accurately determines the degree of periodicity, using the concept of Instantaneous Frequency (IF) estimation in frequency domain. The harmonic structure in the spectrum of LPC residual, within individual bands, is identi ed based on atness of the IF as a criterion for pitch and voicing detection. On this basis, the excitation is modelled by combining a prede ned periodic signal in the lower band and a random signal in the higher band. It is shown that this improves considerably the naturalness of reconstructed speech in very low rate coding in comparison with that obtained using traditional binary excitation [1]. The performance of the technique is also given in Temporal Decomposition (TD) based coding at 800 b/s.
منابع مشابه
Efficient excitation model and fast selection in CELP coding of speech
The paper discusses several new approaches for efficiently modeling and selecting the excitation in CELP coding of speech. Modified error criteria and structured codebooks lead to a wide range of complexity reduction methods, that are evaluated in terms of quality and computational requirements. A very low complexity, though high quality, Regular Pulse (RP) CELP technique is then derived. Final...
متن کاملLow-bit-rate Speech Coding
Low-bit-rate speech coding, at rates below 4 kb/s, is needed for both communication and voice storage applications. At such low rates, full encoding of the speech waveform is not possible; therefore, low-rate coders rely instead on parametric models to represent only the most perceptually-relevant aspects of speech. While there are a number of different approaches for this modeling, all can be ...
متن کاملVery low rate speech coding using temporal decomposition and waveform interpolation
In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...
متن کاملA low resolution pulse position coding method for improved excitation modeling of speech transition
We propose a new excitation model for transitional speech to reduce the distortion due to the traditional two-excitation source, voiced and unvoiced, model. The proposed low resolution pulse position coding (LRPPC) algorithm detects the existence of pulses at frames of weak periodicity, which are determined as unvoiced, and transmits the approximate pulse positions. In the decoder, dispersed pu...
متن کاملStrategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec - Vision, Image and Signal Processing, IEE Proceedings-
This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser techniq...
متن کامل